# Aphid Segmentation Dataset

A dataset for the semantic segmentation of aphid clusters in sorghum fields. The dataset is made up of 54,742 images and their corresponding masks. From initial high-resolution images, we randomly shuffled and split the images into 10 separate groups from which we generated patches at three separate scales with a 10% overlap. The three scales 0.132Hx0.132W, 0.263Hx0.263, and 0.525Hx0.525W have 36478, 14628, and 3636 images respectively.

## Set Up

### Download the files

To set up the dataset for training, download the zip files, and the script to combine and extract the dataset files.

### Extract

`chmod -x ./combine_and_unzip.sh`
`./combine_and_unzip.sh`

When prompted, please enter the password to complete the unzip process.
